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DETAILED ACTION 
Response to Arguments 

1 . Applicant's arguments, see page 8, filed December 20, 2005, with respect to the 
rejection(s) of claim(s) 1,11, and 21 under 35 U.S.C. 102(e) have been fully considered 
and are persuasive. Therefore, the rejection has been withdrawn. However, upon 
further consideration, a new ground(s) of rejection is made in view of Nitta et al. (U.S. 
Patent 4,881,266). 

Up to this point in prosecution of the present application, the Examiner's 
interpretation of "prosodic features", as used in independent claims 1,11, and 21 , has 
been primarily focused on the "pitch" features of the input speech signal. This because 
the Applicant's specification emphasizes the need for pitch analysis in endpointing 
speech signals to overcome the prior art problems of only using pause and duration 
analysis for endpointing speech signals (see Description of Related Art section of 
Applicant's specification). 

In response to the Applicant's arguments filed December 20, 2005, the examiner 
concedes that Lennig does not disclose providing an endpoint signal and the speech 
signal to a speech processing application, as required by independent claims 1,11, and 
21. Lennig instead provides previously endpointed speech signals to a speech 
processing application, and does not disclose providing both the speech signal and a 
separate endpoint signal to facilitate subsequent processing of the speech signal. 
Therefore, the rejections of claims 1,11, and 21 are withdrawn. 



Application/Control Number: 09/829,831 Page 3 

Art Unit: 2655 

However, the term "prosod/' is defined by the Applicants specification as "the 
way speakers modulate the timing, pitch, and loudness of phones words and phrases" 
(see page 2, paragraph 6), The term "prosodic features" as used in independent claims 
1,11, and 21 , therefore, is not limited to pitch features, but includes other features as 
well (i.e. timing and loudness). This is further emphasized by the fact that claims which 
depend from independent claims 1,11, and 21 , such as claims 2 and 3, define the 
extraction of prosodic features as processing pitch information (as in claim 2) as well as 
performing pause analysis (as in claim 3). 

2. Given this interpretation of the term "prosodic features", new grounds of rejection 
are made for claims 1,11, and 21 under 35 U.S.C. 102(b) as being anticipated by Nitta 
et al. (U.S. Patent 4,881,266), because Nitta et al, disclose an endpoint detection 
scheme wherein loudness and duration analysis of an input signal are used for endpoint 
detection. Further, the speech signal as well as a separate endpoint signal are provided 
to a speech processor for subsequent processing. 

Allowable Subject Matter 

3. Claims 2 and 12 are objected to as being dependent upon a rejected base claim, 
but would be allowable if rewritten in independent form including all of the limitations of 
the base claim and any intervening claims. 

As explained above, the Examiner concedes that Lennig does not disclose 
providing an endpoint signal and the speech signal to a speech processing application, 
as required by independent claims 1,11, and 21 . Further, it would not have been 
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obvious to one of ordinary skill in the art to modify Lennig to provide an endpoint signal 
in addition to the speech signal to a speech processing application. 

Nitta et al. disclose using prosodic features to identify a speech endpoint (timing 
and loudness), but provide no suggestion that pitch information be analyzed to 
determine a speech endpoint. 

Claims 3-6 and 13-16 further limit claims 2 and 12 and thus, would also be 
allowable. 

Claim Rejections - 35 USC § 102 

4. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or In public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

5. Claims 1,7-11, and 1 7-21 are rejected under 35 U.S.C. 1 02(b) as being 
anticipated by Nitta et al. (U.S. Patent 4,881,266). 

In regard to claims 1,11, and 21 , Nitta et al. disclose a method, apparatus, and 
electronic storage medium for processing a speech signal comprising: 

extracting prosodic features from a speech signal (Fig. 4, sound powers are 
extracted from the input speech signal, column 4, lines 41-51 ; the loudness of an input 
speech signal is a function of the power of the input speech signal); 

modeling the prosodic features to identify at least one speech endpoint (a word 
boundary is determined when the sound power drops below a predetermined threshold 
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for a predetermined number of frames, column 6, lines 30-42; this modeling, therefore, 
is related to the prosodic features of loudness and timing); 

producing an endpoint signal corresponding to the occurrence of at least one 
endpoint (word frame intervals are sent to the pattern match detector. Fig. 3, 3A, 
column 7, lines 46-65); and 

providing the endpoint signal and the speech signal to a speech processing 
application to facilitate subsequent processing of the speech signal (word frame 
intervals are sent to the pattern match detector. Fig. 3, 3A, column 7, lines 46-65; in 
addition to the LPC coefficients of the input speech signal, see Fig. 3 and column 7, line 
66 to column 8, line 11). 

In regard to claims 7 and 17, Nitta et al. disclose the producing step comprises 
generating a posterior probability regarding the at least one speech endpoint (Fig. 5, 
likelihood calculator 24 determines the probability of the frame intervals, column 7, lines 
46-65). 

In regard to claims 8 and 18, Nitta et al. disclose the posterior probability includes 
a probability that a user has completed an utterance (likelihood calculator 24 determines 
the probability of ending candidates B1 and B2, column 7, lines 46-65). 

In regard to claims 9 and 19, Nitta et al. disclose the posterior probability is 
continuously updated as the speech signal is processed (incoming groups of frames are 
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continuously compared to the sound power threshold to update the likelihood of the 
ending points, column 7, lines 29-44). 

In regard to claims 10 and 20, Nitta et al. disclose executing a speech recognition 
routine for processing the speech signal using the at least one speech endpoint (Fig. 3, 
pattern match detector 3A and discriminator 5A recognize the input speech, column 4, 
lines 19-30). 

Conclusion 

6. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Brian L. Albertalli whose telephone number is (571) 272- 
7616. The examiner can normally be reached on Mon - Fri, 8:00 AM - 5:30 PM, every 
second Fri off. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on (571) 272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR, 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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